Character Recognition in Natural Images
نویسندگان
چکیده
This paper tackles the problem of recognizing characters in images of natural scenes. In particular, we focus on recognizing characters in situations that would traditionally not be handled well by OCR techniques. We present an annotated database of images containing English and Kannada characters. The database comprises of images of street scenes taken in Bangalore, India using a standard camera. The problem is addressed in an object cateogorization framework based on a bag-of-visual-words representation. We assess the performance of various features based on nearest neighbour and SVM classification. It is demonstrated that the performance of the proposed method, using as few as 15 training images, can be far superior to that of commercial OCR systems. Furthermore, the method can benefit from synthetically generated training data obviating the need for expensive data collection and annotation.
منابع مشابه
Improving Text Recognition in Images of Natural Scenes
IMPROVING TEXT RECOGNITION IN IMAGES OF NATURAL SCENES
متن کاملAutomatic detection and recognition of Malayalam text from natural scene images
In this paper we describe a very simple and efficient method for the détection and recognition of the Malayalam text from colour natural scene images taken by a mobile phone camera. Malayalam text detection, skew correction of the detected text ,text segmentation and character recognition are the important steps in text understanding from natural scene images. Text understanding in natural scen...
متن کاملProjection Profile Based Number Plate Localization and Recognition
This paper proposes algorithms to localize vehicle number plates from natural background images, to segment the characters from the localized number plates and to recognize the segmented characters. The reported system is tested on a dataset of 560 sample images captured with different background under various illuminations. The performance accuracy of the proposed system has been calculated at...
متن کاملText Localization and Character Extraction in Natural Scene Images using Contourlet Transform and SVM Classifier
The objective of this study is to propose a new method for text region localization and character extraction in natural scene images with complex background. In this paper, a hybrid methodology is suggested which extracts multilingual text from natural scene image with cluttered backgrounds. The proposed approach involves four steps. First, potential text regions in an image are extracted based...
متن کاملLocalization and Recognition of Text with Perspective Distortion in Natural Scenes
Recognizing text in natural scene images refers to the problem of identifying words that present on it. Scene text recognition is very difficult due to some reasons such as, images contain very little amount of linguistic context, interpreting versions of letters and digits are required for scene text recognition and also scene text can appear in any orientation. Most of the existing works are ...
متن کاملCOCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
This paper describes the COCO-Text dataset. In recent years large-scale datasets like SUN and Imagenet drove the advancement of scene understanding and object recognition. The goal of COCO-Text is to advance state-of-the-art in text detection and recognition in natural images. The dataset is based on the MS COCO dataset, which contains images of complex everyday scenes. The images were not coll...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009